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Abstract — We investigate the MIMO broadcast channel in the 
high SNR regime when linear filtering is applied instead of 
dirty paper coding. Using a user-wise rate duality where the 
streams of every single user are not treated as self-interference 
as in the hitherto existing stream-wise rate dualities for linear 
filtering, we solve the weighted sum rate maximization problem 
of the broadcast channel in the dual multiple access channel. 
Thus, we can exactly quantify the asymptotic rate loss of 
linear filtering compared to dirty paper coding for any channel 
realization. Having converted the optimum covariance matrices 
to the broadcast channel by means of the duality, we observe that 
the optimal covariance matrices in the broadcast channel feature 
quite complicated but still closed form expressions although the 
respective transmit covariance matrices in the dual multiple 
access channel share a very simple structure. We immediately 
come to the conclusion that block-diagonalization is the asymp- 
totically optimum transmit strategy in the broadcast channel. 
Out of the set of block-diagonalizing precoders, we present the 
one which achieves the largest sum rate and thus corresponds to 
the optimum solution found in the dual multiple access channel. 
Additionally, we quantify the ergodic rate loss of linear coding 
compared to dirty paper coding for Gaussian channels with 
correlations at the mobiles. 

I. Introduction 

While the sum capacity of the single-user MIMO point- 
to-point link can be expressed semi-analytically in closed 
form [1], the simplest multi-user setup with single antenna 
terminals already allows for the presumption that this will 
remain infeasible in the broadcast and multiple access chan- 
nel irrespective of whether linear or nonlinear filtering is 
considered. Fortunately, the high signal-to-noise ratio regime 
is an exception to this deflating circumstance, since there, 
asymptotic results on the sum capacity have been discovered 
for dirty paper coding and partly for linear filtering. 

A. Literature Overview 

The single user point-to-point MIMO case was treated 
in [2], [3], where the Grant-Gauthier lower bound on the 
mutual information, that becomes asymptotically tight, was 
decomposed into a supremum capacity term, an instantaneous 
SNR effect term, and an instantaneous capacity degradation 
term due to the eigenvalue spread. Outage capacity and 
throughput of a fading point-to-point MIMO system are ana- 
lyzed in [4]. The first high-SNR sum capacity analysis of the 
point-to-multipoint broadcast channel appeared in [5], where 
single-antenna receivers were considered. Therein, the affine 
approximation of the sum capacity introduced in [6] and 
elaborately discussed in [7] was utilized. First, [5] shows that 



the single-antenna broadcast channel has the same asymptotic 
sum-capacity as the corresponding point-to-point MIMO link 
with cooperating receive antennas, and second, how the power 
offset term in the broadcast channel looks like. Furthermore, 
the instantaneous and ergodic spectral efficiency loss of linear 
zero-forcing beamforming with respect to DPC was derived 
in [5], again for single antenna receivers. The extension to 
multi-antenna receivers was presented in [8], [9], where the 
asymptotic equivalence of the nonlinear dirty paper coding 
sum capacity in the broadcast channel and the sum capacity 
of the equivalent point-to-point MIMO link with cooperating 
receivers was proven to hold in the multi-antenna case. Out 
of the class of linear precoding schemes, zero-forcing and 
block-diagonalization are considered. However, only ergodic 
statements for the asymptotic sum rate and the asymptotic 
rate loss with respect to dirty paper coding are derived, 
and a very special fading model is a key prerequisite for 
the presented results. Expressions for the instantaneous rate 
loss are not possible. Moreover, it is neither known yet, 
whether block-diagonalization is the asymptotically optimum 
transmission strategy or not in the broadcast channel when 
linear filtering is considered, nor how the optimum block- 
diagonalizing precoder looks like. 

B. Contributions 

The main contributions of this paper are summarized in the 
following list: 

1) The derivation of the maximum weighted sum rate 
asymptotically achievable with Unear filtering. 

2) A closed form expression for the asymptotic rate loss 
of linear filtering with respect to dirty paper coding for 
any antenna configuration at the base and the mobiles. 

3) A closed form solution of the covariance matrices in the 
dual uplink achieving this maximum weighted sum rate. 

4) We prove, that block diagonalization is asymptotically 
optimum in the broadcast channel. 

5) Finally, we derive the optimum precoding and transmit 
covariance matrices in the broadcast channel by means 
of our rate duality in [10]. 

II. System Model 

We consider the communication between an N antenna 
base station and K multi antenna terminals, where user k 
multiplexes Bk data streams over his 7-^ antennas. For a 
short notation, we define r as the sum of all antennas at 



the terminals, i.e., r — X]a:=i ^fe' ^^^ ^ ^^ '^he sum of 
all transmitted streams, i.e., b — X)fc=i ^k- In the MAC, 
user k applies a preceding matrix T^ G CfeX^* generating 
his rfc X rfc transmit covariance matrix Qk = T^TJ^. The 
preceded symbol vector propagates over the channel described 
by the matrix Hk £ C^^''''. At the receiver side, zero-mean 



on the covariance matrices of the other users! Consequently, 
the eigenbases of all transmit covariance matrices do not influ- 
ence the rates of the users, only the powers of the eigenmodes 
are relevant. Let the eigenvalue decomposition of Qk read as 
Qk — VfeAfc V^^ with unitary 14 and the diagonal nonnegative 
power allocation Ak. Due to the determinant operator, Vfc 
noise r] e <C'^ wifli identity covariance matrix is added and can be chosen arbitrai'ily and therefore, we set Vk = Ir^ Vfc 



the receive filter for user fc is denoted by Gk G C^''^^. 
Due to the reversed signal flow in the BC, we characterize 
the transmission from the base station to terminal fc by the 
Hermitian channel H^ in the BC, the precoder dedicated to 
the Bk streams of user fc is denoted by P^ £ C^^'^*, and 
zero-mean noise rj}. G 'C'' with identity covariance matrix is 
added at user fc. Throughout this paper, we assume that the 
base station has at least as many antennas as the terminals 
have in sum, i.e., N > r. 

III. Optimum Signalling in the Dual MAC 

Introducing the composite channel matrix H and the com- 
posite block-diagonal precoder matrix T of all K users via 

H^ [Hi,...,Hk] GC^^^ 
T = blockdiag{Tfc}f^i e C'^x^ 

the rate of user fc seeing interference from all other users can 
be expressed as (see [10]) 

Rk = log2 \lN + {lN+Y,HeQeHfy^HkQkH^ 



(1) 



^-logalls, - 



fc -"fc ^ J^kJ-k\, 



where the substitution X reads as 

K 

X = 1n+J2 HiQiHf = In+HTT^H^. 



t=i 



Reformulating the rate expression ([U, we get 

Rk = -log2 \Ej{lb-T''H''X-^HT)Ek\ 



^-\og^\Ef, 



^'\b+T^H^HT) ^Ek\ 



(2) 



where the fcth block unit matrix is defined via 

[O,...,O,Is,,O,...,O]e{0;l}^'=^^ 



Ej 



with the identity matrix at the fcth block. Due to the assumption 
that the base station has more antennas than the terminals 
in sum, all r streams can be activated leading to square 
precoders T^ with Bk = r^ Vfc. Raising Ptx, all r streams 
become active, T becomes full rank, and all eigenvalues of 
T^ H^ HT become much larger than one. In the asymptotic 
limit, we obtain 



Rk^^\og^\T^^E^{H'^H) 
= log2|Qfe| -logal-Efe 



''EkTr^\ 



(H^'H 



Ek 



(3) 



since E{.T-^ ^T^ 



•Trr-i _ rr-1 jp'i' -pjjg notation x = y means that the 
difference x ~ y vanishes when the sum power Ptx goes to 
infinity. Interestingly, the rate of user fc depends only on the 
determinant of his own transmit covariance matrix Qk, and not 



without loss of generality. Let the power allocation matrix be 
composed by the entries Ak = diag{A^ }[^]^. Due to the 
sum-power constraint, the determinant \Qk\ = |^A;| is then 
maximized by setting 

(1) ^(r^.) 



W.' = . . . = A 



:— At 



(4) 



i.e., by evenly distributing the power allocated to that user onto 
his individual modes, so Qk — Afclr^ Vfc with the sum-power 
constraint X]fc=i '"fe^fc — Ptx- Introducing nonnegative weight 
factors wi, . . . , wk for the rates of the users, the weighted sum 
rate asymptotically reads as 



K 



fc=i 



K 

fc=i 



Y,^kRk = Y.Wk{rk log2Afc-log2|-Bj(if "if) 



,A' 



Subject to the sum power constraint X]fc=i ^fc-^fc = -Ptx, 
weighted sum rate in (|5]l is maximized for 



Afc- 



Wk 



Lfci Wirt 



Tx, 



(5) 



the 



(6) 



so the power is allocated to the users according to their weights 
(similar to the single-antenna case proven in [8]), and every 
user evenly distributes his fraction of power onto his modes. 
In the case of identical weights Wk = 1 Vfc, the conventional 
sum rate asymptotically reads as 

K 



K 

E 

fc=i 



Rk = r log2 Ptx-t log2 r-^ log2 



{H'^H) 



fc=i 



and is achieved with Qk = P^^jr ■ I^^ Vfc. So, 



(7) 
we are able 



to quantify the asymptotic sum rate that can be achieved by 
means of linear filtering for every single channel realization 
and antenna/user profile in terms of the transmit power Pxx 
and the channel itself. In principle, the ergodic rate and the 
ergodic rate offset to dirty paper coding can be obtained by 
averaging corresponding to any distribution of the channel. 
In [9], results on the ergodic rate offset with respect to dirty 
paper coding were presented for the specific case of Rayleigh 
fading only, where the channel entries of H\ , . . . , Hk all have 
the same distribution. Simple near-far effects with different 
average channel powers for example cannot be captured due 
to this restricting assumption. Moreover, the instantaneous rate 
offset expression is given by means of bases representing null 
spaces of shortened channel matrices taken from [11] and not 
as a function of the channel purely as we do in (|7]i. 

Concerning the asymptotic rate expressions, we have now 
created a smooth transition from the r single-antenna-users 
system configuration in [5] where no cooperation exists be- 
tween the antenna elements at the terminals, to the single- 
user point-to-point MIMO link where all r antennas fully 



cooperate, see [1] for example. In between, we can now 
specify any antenna/user profile we want and compute the 
feasible rate in the asymptotic limit. Using dirty paper coding, 
the asymptotic sum rate reads as 



K 
Y^ nDPC 



fe=i 



Ri. 



r log2 Ptx - r log2 r + logj I H^H I 



(8) 



and corresponds to the rate of the fully cooperating point-to- 
point link [9]. Combining (O and ^, the rate loss AR = 
X]fc=i(^fc^'~' — Rk) of optimal linear filtering with respect to 
optimal dirty paper coding reads as 



K 

AR9,Y,^og^\El{H^Hy 
fc=i 



'E, 



logJiH^'H)- 



(9) 



which of course vanishes, if all channels are pairwise orthog- 
onal, i.e., if H^H is block-diagonal. Of course, a block-type 
Hadamard inequality quickly leads to the inequality 



k=l 

SO linear filtering is obviously inferior to dirty paper coding. 



IV. Optimum Signalling in the BC 

Using our rate duality in [10], we can convert the simple 
solution for the covariance matrices in the dual MAC to 
covariance matrices in the BC, where the Hermitian channels 
are applied. Since this duality explicitly uses the receive filters 
in the MAC as scaled transmit matrices in the BC, we first 
compute the MMSE receivers in the dual MAC, as they are 
optimum and generate sufficient statistics. The receiver Gk for 
user k in the dual MAC reads as 



Gk 



E'^T^H^il 



N ■ 



HTT^H^) 



With the asymptotically optimum precoders Tk — ^/Pr^/rlrf. , 
above expression asymptotically converges to 

-1 . 



Gu^^VlP^^-El{H'^H) 'h^ 



(10) 



Let Pfc denote the precoder of user k in the BC, then the ith 
column pk,i of P^ follows from the conjugate ith row g^^ of 
the matrix GJ. = W^Gk via (see [10]) 



Pk,. 



O^kag'k, 



ak.i 



/fW^ 



H{H'^H) ^EkWkB,, (11) 



where the scaling factor ak,i is obtained by the duality 
transformation and Wk is a unitary decorrelation matrix. 
Since we convert only the asymptotically optimum transmit 
precoders and receive filters, the duality transformation from 
the MAC to the BC in [10] drastically simplifies and can even 
be computed in closed form. In particular, the matrices Ma,b in 
[10, Eq. (23)] vanish for a / & yielding a diagonal matrix M 
and therefore, the scaling factors read as 



ttfci = 



\Wk,h 



(12) 



In combination with ( fTTI ). the ith column of the precoder 
associated to user k reads as 



Pk. 



= V^Tx/r 



H(H^H) EkWkB,, 



(13) 



\H{H^H) ^EkWke^W, 
generating the precoder matrix 

Pk = /fW^ • H{H''Hy^EkWkD-\ 
where the ith diagonal element of the diagonal matrix Dk is 

[Dk]^,^ = ^ eJW^E^{H^Hy^ EkWkB,. (14) 

We can immediately see, that the precoding filters in ( [13] ) lead 
to a block diagonalization of the transmission, since Hf^Pk — 
holds for k ^ i. Next, the decorrelation matrix Wk which 
enables the duality is usually chosen as the eigenbasis of 
GkHkTk — Irfc, which asymptotically coincides with the 
identity matrix due to dTOl l. Since all eigenvalues are identical 
to one, the decorrelation matrices Wk are not given a priori, 
but can easily be computed such that the BC features the same 
sum rate as the dual MAC. By means of ( [T3] ) and the block 
diagonalization property of the precoders, we obtain for user 
/c's receive signal 



Vk 



HfPkSk + m = VPT^/r ■ WkD^^Sk + Vk, (15) 

where rjk £ C^'' is the noise and Sk the symbol vector of 
user k both having an identity covariance matrix. From dTSl) . 
the rate of user k achieved in the BC reads as 

Rk = log2 I,., + PTx/r • WkD^^W^ 

which asymptotically converges to 

Rk = rk log2 Ptx - rk logs r - loga \DI\. (16) 

Above expression is maximized, if we choose Wk as the 
unitary eigenbasis of E'^{H^H)^^Ek, see ([T4l l. such that 
Z)^ contains the eigenvalues, i.e., the elements of D'^ are 
as different as possible. Thus, the transmit covariance matrix 
Sk = PkP^ of user k reads as 



Sk = ^- H+^EkiEnn^'Hr^Ek 



r'E^H^ 



(17) 



with the channel pseudo-inverse if + = {H^H)^^H^. Note 
that rk eigenvalues of Sk are Prx/r whereas the remaining 
N — rk ones are zero. Thus, Sk is a weighted orthogonal 
projector Furthermore, tr{Sk) — PTx/r V/c, so the power is 
uniformly allocated to the individual users in the broadcast 
channel as well. Comparing ( fTTb with the simple solution of 
the transmit covariance matrix Qk = Prx/r ■ Ir^ in the dual 
MAC, it becomes obvious that the optimum covariance matri- 
ces are much more difficult to find directly in the BC without 
using the rate duality, than in the dual MAC. Plugging the 
optimum D^ containing the eigenvalues of E"^ {H^ H)^^ Ek 
into JTSI l finally yields 

Rk = rk log2 Ptx - rk logs r - logs \Ej{H^H)-^Ek \ . 

Hence, the maximum sum rate (|7]) in the dual MAC is also 
achieved in the BC. 



V. Ergodic Rate Expressions 

In this section, we derive expressions for the asymptotic sum 
rate when averaging over the channel reahzations. The simple 
channel model in [9], [5] is a prerequisite for the application 
of the ergodic analysis due to the fact that an instantaneous 
analysis is not possible there. We choose a more realistic 
channel where near-far effects and channel correlations at the 
terminals are modeled as well, i.e., the individual users can 
also have different average channel powers. Thanks to our 
closed form expression of the maximum asymptotic rate for 
an instantaneous channel realization, the following ergodic 
analysis is basically feasible for any distribution of the channel 
coefficients. The channel matrices of the chosen near-far 
channel model with transmit correlations (in the MAC) are 
defined by Hk — H^C^ Vfc, where the elements of Hk are 
uncorrelated and share a zero-mean i.i.d. Gaussian (distribution 
with variance one, and the Hermitian matrix C^ contains 
the correlations. An uncorrelated channel purely modeling the ^ ' ^ 

near-far effect can be obtained by setting Ck ~ Cfcl^j., where 
cj: > is then the inverse path loss of user k. Let the r x r 
matrix C be defined via 

K 



By means of ( fT9l ) and ( 1211 1. averaging over the asymptotic rate 
difference AR in (|9]l between linear filtering and DPC yields 



K rfc-1 



E[AR] - — [j2^{N~e)-J2 E HN-r+rk-e) 



e=o 



fe=l £=0 



(22) 



from which we can observe that the near-far effect with differ- 
ent path losses and channel correlations does not influence the 
rate difference, since both DPC and linear filtering are affected 
in the same way. 

The general expression ( l22b for the ergodic rate loss E[Z\i?] 
can be simplified by means of ( |20] |. when all users are 
equipped with the same number of antennas. For the first 
special case, assume that each user has f > 1 antennas, i.e., 
ri = . . . = rK — r, such that the total number of antennas 
therefore is r = Kf. After some manipulations, we obtain 



1 
1^ 



r(K~l)f 

y - 



Z^ N - 



Kf + e 



(23) 



C = blockdiagjCfc}^ 
then the frequently arising inverse of H^H reads as 

where H^H ~ Wr{N,Ir) has a Wishart distribution with 
N degrees of freedom and {H^H)~^ - W'^iN, I^) has an 
inverse Wishart distribution, see [12], [13]. Thus, the ergodic 
value for the channel dependent log-summand in the DPC sum 
rate expression dHJ reads as [14] 

_. r-l K 

E[\og,\H^H\] = Y.^{N-£) + Y,iogyCk\, (19) 



which coincides with the results in [8], [9], but is a different 
representation. For convenience, we assume that the summa- 
tion vanishes if the upper limit of a sum is smaller than the 
lower one, which happens for f = 1. In this second special 
case with single antenna receivers, i.e., ?" = 1 = r^ V/c and 
r = K, the second sum in ( |23] | consequently vanishes, and 
the ergodic rate loss simplifies to 



1 



K-l 



E\AR] = / 

e.=\ 



(24) 



which is also a result of [5]. 



^=0 



fc=i 



where the Digamma-function ?/)(•) with integer arguments is 
defined via [14] 

Mn + 1) = V(") + - if n e N, MX) = -7, (20) 

n 

and 7 is the Euler-Mascheroni constant. Note from iT% 
that different path losses and correlations in the channel 
coefficients simply lead to a shift of the asymptotic rate 
curve. Concerning the rate expressions with linear filtering, 
we exploit the property that the fcth main diagonal block of 
(H^H)^^ is also inverse Wishart [13]: 

E^iH^H)-'Ek ^ W-^'{N - r + rfc,I,J, 

In combination with E^^C^^ = C^ ^-Ejf, this leads to the 
ergodic expression 



E[log2|C, 



^El{H''H)-'EkC;--\ 



Tk-l 



log2 \Ck\ -T^Y.^{N-r + rk-l). 



(21) 



«=o 



VI. Numerical Examples 

In Table m we present the ergodic rate loss of linear filtering 
with respect to dirty paper coding for different parameters N, 
K, f, ri, and r2, where we employed ( |23] l and ( l24l l for the 
case f = ri = . . . = rK (cf. [5], [8], [9]) and (|22] | for the case 
of different numbers of antennas ri, r2- It can be seen that a 
fully loaded single antenna system with K — N and f — 1 
has to face a significant rate reduction when switching from 
nonlinear to linear filtering. Moreover, comparing the K — 2 
and f = 3 system with the one where K — i and f = 2, we 
observe that the rate loss in the first system is only 65 percent 
of the one in the second system for A^ = 6. We can infer that 
fewer terminals with many antennas have to face smaller rate 
losses than many terminals with only few antennas. 

Next, we plot the ergodic sum capacity with DPC and the 
ergodic sum rate when linear filtering is applied versus the 
transmit power Ptx to see how large Ptx must be to let the 
asymptotic affine approximations become tight. To this end, 
we choose a system configuration where K ^ 2 users each 
having f = 2 antennas are served by an iV = 5 antenna 
base station. Different path losses are modeled by setting 
Ci = I2 and C2 = 2 • I2, i.e., user 2 has a stronger channel 
on average, and we averaged over 1000 channel realizations. 
While the DPC sum capacity can easily be computed via the 
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TABLE I 

Asymptotic ERGODic rate loss E[Z\J?] in bits/s/Hz; A^ antennas 
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Fig. 1. Ergodic .sum rate of linear filtering and DPC and their respective 
affine approximations with K = 2, N = 5, and ri = r2 = f = 2. 



algorithms in [15] or [16], an algorithm proven to reach the 
maximum sum rate under linear filtering does not seem to be 
available yet. Hence, we utilize our combinatorial approach 
in [17], which obtains the best sum rate hitherto known in 
the case of linear filtering. Fig. [T] shows that the asymptotic 
affine approximations become tight already for Ptx smaller 
than 20dB and confirms the asymptotic ergodic rate loss 
E[Z\i?] ~ 2.04 from Table U which is independent of the 
different average channel powers. For a multiplexing gain of 
r = 4 as in the chosen system configuration, this translates 
to an asymptotic power loss of 1.54dB of linear filtering with 
respect to DPC. 

VII. Conclusion 

In this paper, we derived the asymptotic sum capacity which 
is maximally achievable with linear filtering in the broadcast 
channel by means of our rate duality linking the rate region 
of the multiple access channel with the broadcast channel 
rate region. Due to the closed form expression of the asymp- 
totic sum capacity for every single channel realization, the 
instantaneous rate loss with respect to dirty paper coding was 
presented, and the ergodic rate loss can quickly be computed 
or simulated for any distribution of the fading process. As an 
example, we presented the solution of the ergodic rate loss 
for a simple fading model incorporating the near-far effect 
and correlations at the mobiles. Another key result proven 
is that block-diagonalization is the asymptotically optimum 
transmission strategy in the broadcast channel. 
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